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ABSTRACT Bacteria and eukaryotes are involved in many types of interaction in nature, with innportant 
ecological consequences. However, the diversity, occurrence, and mechanisms of these interactions often 
are not fully known. The obligate bacterial endosymbionts of Paramecium provide their hosts with the ability 
to kill sensitive Paramecium strains through the production of R-bodies, highly insoluble coiled protein 
ribbons. R-bodies have been observed in a number of free-living bacteria, where their function is unknown. 
We have performed an exhaustive survey of genes coding for homologs of Reb proteins (R-body 
components) in complete bacterial genomes. We found that reb genes are much more widespread than 
previously thought, being present in representatives of major Proteobacterial subdivisions, including many 
free-living taxa, as well as taxa known to be involved in various kinds of interactions with eukaryotes, from 
mutualistic associations to pathogenicity. Reb proteins display very good conservation at the sequence 
level, suggesting that they may produce functional R-bodies. Phylogenomic analysis indicates that reb 
genes underwent a complex evolutionary history and allowed the identification of candidates potentially 
involved in R-body assembly, functioning, regulation, or toxicity. Our results strongly suggest that the ability 
to produce R-bodies is likely widespread in Proteobacteria. The potential involvement of R-bodies in as yet 
unexplored interactions with eukaryotes and the consequent ecological implications are discussed. 
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During more than two billion years of coexistence, prokaryotes have 
established various forms of interaction with eukaryotes. Examples 
include the mutualistic symbioses that benefit eukaryotic host by pro- 
viding nutrients, defense, competition, and adaptation to new environ- 
ments (Gast et al. 2009). At the same time, bacteria have developed 
various ways to defend themselves against grazing by eukaryotes 
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(Matz and KjeUeberg 2005), with potential implications for the emer- 
gence of pathogens (Brown and Barker 1999). However, the extent 
and diversity of bacterial/eukaryotic interactions in nature remains 
largely underexplored. As a growing amount of genomic data covering 
a large fraction of bacterial diversity becomes available, hints of such 
relationships may be gathered from in silico analyses. These can be 
linked to experimental observations, providing useful directions for 
further work. 

A fascinating example of a bacterial/eukaryote relationship is 
provided by the killer endosymbionts of the ciliate Paramecium. In 
the 1930s Tracey Sonneborn discovered that some strains of the 
Paramecium aurelia complex have a killer phenotype toward sensitive 
strains (Beale and Freer 2004; Freer 2006; Sonneborn 1938). Sonne- 
born could show that this phenomenon is not controlled by nuclear 
genes, providing one of the first examples of cytoplasmic inheritance 
(Sonneborn 1943). It was later discovered that the killer phenotype is 
conveyed by an obligate endosymbiotic bacterium (also referred to as 
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a kappa particle), and each killer paramecium strain harbors its own 
specific endosymbiont that usually resides in the cytoplasm but can 
also be in the nucleus (Freer 1975). Whereas all these obligate endo- 
symbionts initially were placed in the genus Caedibacter, they were 
later shown to belong to different proteobacterial lineages (Beier et al 
2002). 

In Caedibacter, the killer trait is directly linked to the production 
of R-bodies, unusual cytoplasmic refi-actile inclusion bodies (Dilts and 
Quackenbush 1986) [for review, see (Pond et al 1989)]. What is 
known about R-bodies comes primarily from Caedibacter taeniospiralis, 
which belongs to the gammaproteobacteria family of Thiotrichales 
(Figure 1) (Beier et al. 2002). R-bodies are highly insoluble protein 
ribbons that are typically coiled into cylindrical structures. They are 
produced by only a fraction of the endosymbiont population, which 
then stop dividing. When R-body— containing bacteria are released 
into the environment and captured by sensitive strains, killing 
occurs very rapidly. The exact mechanism for killing is not known, 
but it is thought that internalization of the R-body- containing bac- 
teria into the food vacuole triggers unrolling of the R-body, which 
penetrates the phagosomal membrane and delivers a killer toxin to 
the cytoplasm (Figure 1) (Jurand et al. 1971; Freer et al. 1974). 
Isolated R-body— containing bacteria are capable of killing sensitive 
Paramecium strains in various ways, with prelethal symptoms such 
as paralysis, vacuolization, and opposite swimming rotation. On the 
contrary, exposure to non-R-body— containing Caedibacter is not 
lethal [for review, see (Fond et al. 1989)], and a mutant Caedibacter 
strain unable to make R-bodies loses its killing ability (Dilts and 
Quackenbush 1986). Different strains of Caedibacter produce differ- 
ent types of R-bodies, which vary in diameter (0.25 — 0.8 mm), length 
(<10 — 30 mm), ribbon morphology (tapered or blunt termini), 
mode of unrolling (from the outside or from the inside in a telescopic 
fashion), and the nature of the stimulus for unrolling (changes in pH, 
temperature, ionic strength) [for review, see (Fond et al. 1989; Sanchez- 
Amat 2006)]. Interestingly, the unrolling of type 51 R-bodies has been 
shown to be reversible (they unroll when the pH is dropped <6.5 and 
reroU when the pH is again raised >7.0) (Fond et al. 1989). 



Studies of the genetic determinants of R-bodies began in the early 
1980s, and in C. taeniospiralis the R body— coding region was found 
to lie on a plasmid (Quackenbush and Burbach 1983). When a region 
from the pKAF47 plasmid (from Paramecium teraurelia California 
strain 47) was cloned into Escherichia coli, R-bodies were produced, 
but the clones did not exhibit toxicity toward sensitive Paramecium 
strains (Quackenbush and Burbach 1983). Therefore, whereas pro- 
duction of R-bodies is necessary for killing by C. taeniospiralis, as 
described previously, it is not sufficient for killing by recombinant 
Escherichia coli. This excludes a direct cytotoxic effect of R-bodies 
and indicates a requirement for an essential unknown toxin encoded 
either by the plasmid or the C. taeniospiralis genome (Freer and Stark 
1953; Quackenbush and Burbach 1983). These data have been recently 
confirmed, and it has been shown that recombinant E. coli expressing 
the four reb genes of the C. taeniospiralis pKAF298 plasmid (from 
Paramecium teraurelia Panama strain 298) were capable of producing 
R-bodies but were not toxic toward sensitive Paramecium strains 
(SchraUhammer et al. 2012). 

Very little information is available on the assembly process of R- 
bodies. At least three polypeptides of 10, 13, and 18 kDa were found to 
be involved in the structure and assembly of type 51 R-bodies 
(Kanabrocki et al. 1986) and were later shown to be encoded by three 
genes: rebC, rebB, and rebA, respectively, the last two being homolo- 
gous (Heruth et al. 1994). These early data proposed that the major 
structural protein is RebB and that RebA may act as a scaffold to 
facilitate the polymerization process whereas RebC may act as a tran- 
scriptional regulator (Heruth et al. 1994). Finally, it was suggested that 
RebB might be modified posttranslationally, with the possible involve- 
ment of RebC (Heruth et al. 1994). The role of a fourth gene in the reb 
locus, rebDy coding for a homolog of RebA and RebB, is unclear but it 
was shown not to be necessary for R-body production in E. coli 
(Heruth et al. 1994). 

The complete sequence of the Reb-harboring pKAF298 plasmid of 
C. taeniospiralis strain 298 was obtained in 2005 (Jeblick and Kusch 
2005). It was found that this plasmid contains 63 open reading frames, 
23 only having similarity with proteins with known function, and 



Killer endosymbiont with R-bodies 




Figure 1 Illustration of the C. taeniospiralis R- 
body toxin delivery system (see main text for 
details and references). 
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released 
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a few being similar to proteins encoded by phages or prophages, 
which led to the suggestion that the plasmid originated from a bacte- 
riophage (Jeblick and Kusch 2005), which is consistent with early 
observations of the association of phage-like particles with R-bodies 
(Freer et al 197 A). A protein with homology to the Soj-ParA family of 
membrane-associated ATPases was suggested as a possible candidate 
for the toxin, which would kill the host by somehow affecting its 
membrane, although a precise mechanism was not proposed (Jeblick 
and Kusch 2005). 

The harboring of an endosymbiont that produces R-bodies gives 
a competitive advantage to its killer Paramecium host with respect to 
sensitive strains (Kusch et al 2002). In turn, R-body production seems 
to play a role in defense against predation and creates a benefit for the 
Caedibacter strains at the population level (Sanchez- Amat 2006). 
However, many important questions remain to be clarified. For ex- 
ample, it is not known how obligate symbiosis is established in the 
first place or how sensitive strains can pick up Caedibacter and be- 
come killers, nor how killer Paramecium strains are protected from 
their own specific R-body producing endosymbionts (Gibson 1973; 
Preer et al 1974). 

Interestingly, casual observations of coiled R-body structures of 
various types have been reported in several free-living bacteria: the 
hydrogen-oxidizing P-proteobacterium Pseudomonas taeniospiralis 
(Lalucat and Mayer 1978), now known as Hydrogenophaga taeniospir- 
alis; the soil (B-proteobacterium Pseudomonas avenae (Wells and 
Horne 1983), now known as Acidovorax avenae subsp. avenae; the 
soil strain Pseudomonas sp. EPS-5028 (Fuste et al 1986); the anoxi- 
genic photosynthetic N2-fixing a-proteobacterium Rhodospirillum 
centenum (Favinger et al 1989); the soil strain Pseudomonas aerugi- 
nosa 44T1 (Espuny et al 1991); and the melanin-producing marine 
7-proteobacterium Marinomonas mediterranea (Hernandez-Romero 
et al 2003). However, no further study on these R-body structures has 
been reported for any of these species [for review, see (Sanchez- Amat 
2006)], nor have they been linked to the presence of Reb homologs in 
their genomes. Therefore, the role of these R-bodies in these diverse 
bacterial remains puzzling. A recent study has shown the presence of 
Reb homologs in the genome of the rhizobiale Azorhizobium cauli- 
nodansy a microsymbiont of the tropical legume Sesbania rostrata 
(Akiba et al 2010). Interestingly, deletion of the putative transcription 
factor praR caused aberrant nodule formation and was linked to 
greater expression of the reb locus. On the contrary, a double reb 
and praR mutant had a restored wild-type nodule formation. The 
authors hypothesized that praR is essential to suppress the killer trait 
conferred by the reb locus and establish symbiosis between A. cauli- 
nodans and S. rostrata (Akiba et al 2010). However, it is not known 
whether A. caulinodans is able to make R-bodies. The authors also 
reported the presence of Reb homologs in a number of Proteobacteria 
and in the Bacteroidetes member Kordia algicida OT-1 (Akiba et al 
2010). 

Here, we have performed an exhaustive phylogenomic analysis of 
Reb homologs in currently available bacterial genomes. Reb homologs 
are widely distributed in members of Proteobacteria, comprising many 
free-living taxa as well as symbionts or pathogens of various eukar- 
yotes, including humans. The evolutionary history of reb genes 
appears very dynamic, involving vertical inheritance, horizontal 
gene transfers, and gene duplications. By combining phylogenetic, 
genome synteny, and genomic content analyses, we highlight a few 
potential candidate partners of Reb proteins. Finally, we found no 
clear signs of reb loci originating from defective prophages, or from 
recent transfer via mobile elements. Ecological implications are 
discussed. 



MATERIALS AND METHODS 
Homology searches 

Reb proteins (A-D) encoded in the plasmid pKAP298 from C. tae- 
niospiralis (AAR87077.1, AAR87076.1, AAR87131.1, AAR87075.1) 
were used as seeds to search for Reb homologs in the nonredundant 
protein database at the National Center for Biotechnology Informa- 
tion (NCBI). Homology searches were performed by BlastP (Altschul 
et al 1997) and all hits within an e- value cutoff of 1 x 10~^ were 
retained. PSI-BLAST and tBLASTn programs (Altschul et al 1997) 
also were used to search for highly divergent or misannotated Reb 
homologs. Searches were reiterated by using a number of seeds from 
various taxa. We also performed targeted searches against the meta- 
genome and the viral sequence databases at the NCBI, and against all 
eukaryotic genome sequence available at the Joint Genome Institute 
(http://genome.jgi.gov). Sequences were aligned using Muscle 3.8.31 
(Edgar 2004). Poorly aligned or divergent sequences were manually 
removed. Finally, HMM searches were performed with HMMER 3.0 
(hmmer.org) against a local databank of 841 complete bacterial 
genomes (only one representative per species), including 435 from 
Proteobacteria downloaded from the NCBI ftp Genomes server using 
a model built on the multiple alignment of all previously recovered 
Reb proteins, but no additional homologs were found. 

Sequence analysis 

Sequence secondary structures were predicted using PSIPRED (http:// 
bioinfcs.ucl.ac.uk/psipred/) (Buchan et al 2010). Alpha helical wheel 
diagrams were created using the tool created by Don Armstrong and 
Raphael Zidovetzki (http://rzlab.ucr.edu/scripts/wheel/wheeLcgi). Pre- 
dictProtein was used to search for additional structural features 
(http://www.predictprotein.org/). We searched for Reb homologs with 
available 3D structures using sequence based-PSI-BLAST (Altschul 
et al 1997) searches at the PFAM and Uniprot databases by using 
HHpred (http://hhpred.tuebingen.mpg.de/hhpred) and FFAS03 
(http://ffas.ljcrfedu/ffas-cgi/cgi/fifas.pl). WebLogo analysis was per- 
formed at http://weblogo.berkeley.edu/. 

Phylogenetic analysis 

The final dataset of 203 Reb homologs was aligned using Muscle 
3.8.31 (Edgar 2004) and trimmed using BMGE (Criscuolo and Gri- 
baldo 2010) with the less- stringent parameters (Blosum30), giving 
a dataset of 73 unambiguously aligned amino acid positions for phy- 
logenetic analysis. A Bayesian tree was obtained using Phylobayes 3.3 
(LartiUot et al 2009). (See figure legends for details on analyses.) 

16s rRNA sequences from Proteobacterial taxa representative of 
the diversity of this phylum were downloaded from the NCBI, as well 
as from the specialized Silva (http://www.arb-silva.de/) and the Ribo- 
somal Database Project (http://rdp.cme.msu.edu/) databases. Sequen- 
ces were aligned using Muscle 3.8.31 (Edgar 2004) and manually 
trimmed using the ED program of the MUST suite (Philippe 1993). 
Maximum likelihood trees were obtained using Treefinder (Jobb et al 
2004). (See figure legends for details on analyses.) 

Genome synteny and genome content analysis 

For genome synteny analysis, we retrieved the five open reading 
frames upstream and downstream of the reb loci (or extracted from 
contigs when the whole genome sequence was not available). Protein 
sequences were defined as homologous when sharing at least 40% 
similarity and less than 20% difference in length. Pairs of homologous 
proteins were then expanded to homologous protein families by in- 
cluding all proteins homologous to at least one member of the family. 
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Taxon Name 


Class 


Accession number of Reb homologues 


Azorhizobium cdulinodsns ORS 571 


Alpha proteobacteria 


VD nni Q7f;AQ7 vd nni q "jaaqh vd nni q')<;(;qq vd ^\^\^^^c'Jf\'> 

Tr UUIjZdO^/ Tr UUl_>ZDD"cS Tr UU1_>ZDDjj ir 


Azospirillum brdsilsnss Sp245 (m6g3pl3smid) * 


Alph3proteob3cteria 






Alphaprotcoboctcria 




Brcvundimonds sp> BAL3 


Alphaproteobacteria 




Brcvu ndi mon 3s subvibrioides ATCC 15264 


Alphaprotoobiictoria 




L3brcnzid 3lcxdndrii DFL*H 


Alphaproteobacteria 


7P n>iii7377 7p n^iif^i^nA 


Mdricdulis maris MCSIO 


Alphaproteobacteria 


VD scenic 

T r_/ DDUZD 


Oc63nic3ulis 3l6X3ndrii HTCC2633 


Alphaproteobacteria 


7D nnQ^?7Qn 7p nno^^^oc 


Polymorphum gilvum SL003B-26A1 


Alphaproteobacteria 


Art7AQQ^1 Art7AQQC;A Ar^7CQSC7 
MU^D7031 MUZOyoDD MUZD703/ 


r St ' u a 1. J vlj 1 I [ 1 -,fj rvj t*i[ t^j 1 


^ . — ■ — ■ — ■ 




Pseudovibrio sp. JE062 


Alphaproteobacteria 


7D ncnoiaoc 7d ncnoaciA 7d ncnsacaa 7d ncnsa^oo 7d ncnoascc 


Rhodob3cter3ce3e b3Ct6riuni HTCC2150 


Alphaproteobacteria 


Lr 


Rhodospirillum centenum SW 


Alphaproteobacteria 


VD nn77Q07nn vd nn77QQ7ni vd nn77Q07n7 vd nfi77QB7n7 vd nn77QB7n/i vd nn77Q07nt 

T r_UUZZyi5ZUU T r _UUZZyOZU 1 Tr UUZZytJZUZ Tr_UUZZ30ZU3 T r_UUZZ30Ztm Tr_UUZZ3oZU3 


Roseibium sp. TrichSKD4 


Alphaproteobacteria 


Lr U/DOlllU Lr U/OOlllZ Lr U/OOlll/ 


Ru6g6ri3 pomeroyi DSS-3 (meg3pl3smid) * 


Alphaproteobacteri3 


VP 1fi*i77n VP Ifi^i??^ VP 1f^^77A 


Ruegeria sp. TM1040 


Alphaproteobacteria 




Acidovorax avcnac subsp. avena6 ATCC 19860 


Beta proteobacteria 


VP nnA7a(;RAi; vp nnA7aARA(; vp (\(\&')'xe>nA^ vp nnA7af;RAR 


Burkholdcria ambifaria AMMD 


Beta proteobacteria 


VD 777ina VD 77ainji vd 77iinc vd 773ino 

Tr_/ /31Uj Tr_//3lU'» Tr_//31UO Tr_//3lUO 


Burkholderis ambifsris IOP40-10 


Betaproteobacteria 


ZP 02889068 ZP 02889069 ZP 02889070 ZP 02889073 


Burkholdcria ambifaria MC40~6 


Beta proteobacteria 


VP nniRn7Q4t; vp nniRn7Q4fi vp nniRn7QA7 vp nniRn7Q';n 

Tr_UUloU/^'tD Tr_UUioU/ytO T r_UU JLoU / / Tr_UUloU/3DU 


ourKnoiaena amDiTaria mca-s 


Betaproteobacteria 


Ijr Lr UZ^UboUU Lr UZsUaoUl Lr UZ^UsolM 


Burkholderia gladioli BSR3 


Betaproteobacteria 


ACAC^dA ACAC7dC ACAC^CIC ACAC7d7 ACAC3Q37 
ACAOZD14 HCAOZD13 ACAOZDIO ACAOZOl/ ACA0333/ 


Burkholdcria glumas BGRl 


Betaproteobacteria 


VD nn7ona7Ai vd n/i7ana7A7 vd nn7Qrto7Aa vd nn7Qno7AA vd nn7anQ7AC vd nn7ono7AC 


DurKnoiucrid maiici miv.v. iSitH 


Betaproteobacteria 


AAI l^ni 1 A 


RiirLKnIrlari'S mnlloi hirTC 'Xnt'iQ 

DurKnoiucrid mdiici n^iv. lu^z? 


Betaproteobacteria 


ARMn37AA 


niirLknMari-s m-xllai hlfTr ^ntA^ 

DurKnoiQcria maiici iuz4/ 


Betaproteobacteria 


ADI>IUoZ<tD 


Burkhold6ria mallei SAVPl 


Beta proteobacteria 


ABM49889 


Burkholderia oklahomensis C6786 


Bet3proteob3cteria 


7D n7aC17Q7 7D n7ad7Qa 7D n7aC170A 7D n7aC170C 

Lr UZ301Z7Z Lr UZ30IZ73 Lr UZoDIZt** Lr UZ901Z73 


Burkholderia oklahomensis E0147 


Bet3 proteobacteria 


7D n7aQA11Q7D n73QA11C7D n73QA117 7D n73CA11& 
Lr UZ99<lll9 Zr \i£SV*xXXi Lr MLiV*XXi Lr MLSo'*XXo 


Burkholderia pseudomallei 1106a 


Betaproteobacteria 


VD nmnt/it:c7 

Tr_UUlUD*l 33/ 


DU r K noKirr ui pscLiciorTiiiiici i / luo 


Bet J pr tit coh jc t cria 




Burkholderi3 pseudom3llei 305 


Betaproteob3cteria 


Lr Ul //U9DD 


Burkholderia pseudomallei 576 


Betaproteob3cteria 


7D naAQfldRI 


Burkholderia pseudomallei 668 


Beta proteobacteria 


VP 0010(^7^1^ 
T r_UUAU3/ 3X3 


Burkholderia pseudomallei K96243 


Betaproteobacteria 


VP inARQa 
Tr 11/0033 


Burkholderia sp. 383 


Betaproteobdcteria 


VD accccB VD acccco vd accc7n vd accc7a 
Tr_30booo Tr_300b0i» Tr_300b/U Tr_30bo/3 


Burkholderia sp. CCGElOOl 


Betaproteobacteria 




Burkholderia sp. CCGE1002 


Betaproteobacteria 




Burkholderia sp. CCGE1003 


Betaproteobacteria 


YP 003910733 YP 003910734 YP 003910737 


DUi KflUlUt:) Id bp. TIZJ 


Betaproteobacteria 




Burkholderia thailondensis E264 


Betaprotpobocteria 




Burkholderia thail3ndensis MSMB43 


Betaproteobacteria 


ZP 02461981 ZP 02461983 ZP 02461984 ZP 02461985 ZP 02461986 


Burkholderia ubonensis Bu 


Betaproteobacteria 


ZP 02381581 ZP 02381584 ZP 02381585 ZP 02381586 


Chromobacterium violaceum ATCC 12472 


Betaproteobacteria 


NP 900391 NP 900392 NP 900393 NP 900394 NP 900403 NP 901057 


Ralstonia syzygii R24 


Betaproteobacteria 


CCA8691S.1 CCA86916.1 CCA86917.1 CCA86918.1 


Ples'oc V'^ti'. pjcifici SIR-1 


Drlt.i[jrf)troh,icteria 


VI' ' 1, U02.1i382b Yf 002433827 YP 002433828 Yf 002433829 YP 002433830 

ZP 1 1912576 ZP 01912577 ZP 01912578 ZP 01912579 ZP 01912580 ZP 01912581 


Caedibacter taeniospiralis (plasnnid) ^ 


Gam ma proteobacteria 


AARfi7n7i; 1 AARR7n7(> 1 AARR7n77 1 


Marinomonas mediterranea MMB*1 


Gammaproteobacteria 


Ar»7Q1Qni Ar>7Q1Qnti Ar>7Q1Qnfi Ar^7Q1Qn7 Ar»7Q1Qnfl Ar>7Q1Q^^ An7Q1Q^A Ar>7Q1Q^«; An7Q1Q^R 
MUZ^l^^Ul ML'Z^l^UD MUZ^l^^UD MLfZ^lI^U/ MUZ^X^Uo MUZ!7XI733 MUZ^X!73*4 ML/Z^XZ733 ALJZ7X330 


Marinonfionas sp. MED121 


Gammaproteo bacteria 


7D nin77RnA 7D nin77Rn7 7D nin77Rnfl7D nin77fl/iQ7D nin77fiin7D nin77Rii 


Marinomonas sp. MWYLl 


Gammaproteobacteria 




Pseudomon3S 3eruginos3 LESB58 


G3mm3proteob3Cteri3 


VD nn^AaoRAO vd nn^AaoRcn vd nn7AaQflcc 

T r_UUZ'*330'*l7 Tr_UUZ*l3303U Tr_UUZ*»3 j033 


Pseudomonas aeruginosa M18 


Gammaproteobacteria 




Pseudomonas acru^tnosci NCGM2.S1 


GamrTiaproti^obactona 




Ps6udoinon8S seruginosa PACS2 


Gammaproteobacteria 


7D M 3AA7ftn 7D ni 3AA7fiA 7D fll 3AA7fl7 
tr UXSODZoll tr UXJDOZBD tr UX3DDZB/ 


Pseudomonas aeruginosa PA7 


Gammaproteobacteria 




Pseudomonas aeruginosa PAOl 


Gammaproteobacteria 




Pseudomonas aeruginosa UCBPP-PA14 


Gammaproteobacteria 


VP 7Rn^Q^ VP 7Qn^^A 
Tr_/3U333 Tr_/yU33'* 


Pseudomonas aeruginoss 39016 


Gammaproteobscteria 


7d n77QAian 


PsGudomonas fluoroscons F113 


Gammaprotoobactona 




Pseudomonas fluorescens Pf-5 


Gammaproteobacteria 


VP 7^7a7fi VP 7^7a7Q VP 7«;7^an 

Tr_Z3/3ZO Tr_Z3/3Z3 Tr_Z3/33U 


Pseudomonas fluorescens PfO-1 


Gammaproteobacteria 




Pseudomonas fluorescens SBW25 


Gammaproteobacteria 




Saccharoph3gus degr3d3ns 2-40 


Gammaproteobdcteria 


YP 528473 


Shew3nell3 denitrific3ns OS217 


G3mm3proteob3Cteria 


VD CCA1A7VD CCA1AOVD CCA1AOVD CCAICn 

Tr_3b*H»#/ Tr_3b«lX«»0 Tr_3b*H*»3 Tr_3b*»X3U 


Stenotrophomonas maltophilia K2798 


Gammaproteobacteria 




Stenotrophomonas maltophili3 R551-3 


Gammaproteobacteria 


VD nn7n7flAfia vd nrt7n7flAQA vd nn7n7flA0c vd nn7n7flAflc 

Tr_UUZUZ0'*03 T r_UUZUZO*IO»* T r_UUZUZO'lo3 T r_UUZUZOHOO 


Stenotrophomon3s sp. SKA14 


Gamm3proteob3cteria 


7D ncia70A7 7D nciacAB7 7D nciac7An 

Zr_U3X3Z3HZ Zr_U3X33'MJ/ Zr_U3X33/'KI 


V/ihrin <-nrall!ilu*iri ic ATff* RAA.AC/I 

viorio curaiMiiyiicus miv.v. dhm-*i9u 


Gammaproteobacteria 


7D n'^RR<^7QR 7P n^RR^7QQ 7P n^RR^ROn 7D n^RRQRnA 
Lr U3003/30 Zr U3003/39 Zr U30030UU Zr USOOSOU** 


Vibrio fischeri ES114 


Gamm3proteob3Cteria 


VP 7n7nR«; vp 7n7nQn vp 7n7nQi vp 707007 vp 7n7nQa vp 7n7n<)A vp 70700^ 

Tr_ZU/U03 Tr_ZU/U3U Tr_ZU/U3X Tr_ZU/U3Z Tr_ZU/U33 Tr_ZU/U3'* Tr_ZU/U33 


\/ihrin fitrhon ^^ 11 1 

viDrio Tiscnpn ivui i 


Gammaproteobacteria 




viDno nigripuicnriiuuO ailu ^/vh 


Gammaproteobacteria 


7D nfl73n010 7D nS71A073 7D nfl73na7A 7D nfi79rM7C 
Zr llB/3ll7l7 Zr ^JnlaM^tiLr MoHmt'* Lr MoiS\ni.O 


Xsnthomonas axonopodis pv. citrl str. 306 


Gdmmdproteob3Cteri3 


NP AA^^77 NP AA7^7A NP AA777<; NP AA^^7f^ NP f^A^aQA 
vir DH33Z3 \Hr Om33Z*4 vir DM33Z3 FMr Dm33ZD vir DH333D 


Xanthomonas axonopodis pv. citrumelo Fl 


Gammaproteobacteria 




Xanthomonas campestris pv. campestris str. 8004 


G3mm3proteob3Cteria 


YP 242290 


Xanthomonas campestris pv. campestris str. ATCC 33913 


Gamm3proteob3Cteria 


MD Aafl7CA 

iMr b3oZ30 


Xanthomonas campestris pv. campestris str. BlOO 


Gammaproteobacteria 




Xanthomonas campestris pv. raphani 756C 


Gammaproteobacteria 




Xanthomonas campestris pv. vesicatoria str. 85-10 


Gammaproteobacteria 


Tr_30»iy»o 






ZP 06486467 


Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122 


Gammaproteobacteria 


ZP 06705918 


Xanthomonas oryzae pv. oryzae KACC 10331 


Gammaproteobacteria 




Xanthomonas oryzae pv. oryzae MAFF 311018 


Gammaproteobacteria 




Xanthomonas oryzae pv. oryzae PX099A 


Gamm3proteob3cteria 


YP_001914301 


Xanthomonas oryzae pv. oryzicola BLS256 


Gammaproteobacteria 


AEQ97370 


Xanthomonas perforans 91-118 


Gammaproteobacteria 


ZP 08187121 


Kordia algicida OT-1 


Bacteroidetes 


ZP 02161959 ZP 02161960 ZP 02161961 ZP 02161966 
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For genome content analysis, families of homologous proteins were 
built from 861 fully sequenced bacterial genomes downloaded from the 
NCBI ftp Genomes server. Protein sequences were defined as 
homologous if sharing at least 50% similarity and less than 20% 
difference in length. Pairs of homologous proteins were then expanded 
to homologous protein families by including all proteins homologous 
to at least one member of the family. 

RESULTS 

Taxonomic distribution 

Although the production of R-bodies has been observed in a few free- 
living bacteria (see Introduction), the effective distribution of Reb 
homologs in prokaryotes has not been clear. We carried out an 
exhaustive search for Reb homologs in current sequence databases 
(see Materials and Methods) (Figure 2). We found no additional 
homologs of RebC other than the C. taeniospiralis pKAP298 plasmid, 
indicating that this protein is specific to the Caedibacter Reb system. 
On the contrary, RebB, RebA and RebD are homologous and widely 
distributed. We identified 203 Reb homologs from 64 taxa belonging 
exclusively to Proteobacteria, with the one exception of Kordia algicida 
OT-1, which belongs to the phylum Bacteroidetes, as recently noticed 
(Akiba et al. 2010). Reb homologs are widely distributed among rep- 
resentatives of four of the six subdivisions of Proteobacteria, Alpha, 
Beta, Gamma, and Delta (Figure 3). We found between one and nine 
Reb homologs in each genome (Figure 2). Although reb genes were 
first identified on the C. taeniospiralis pKAP298 plasmid, we found 
Reb homologs on only two additional plasmids: the megaplasmid 
from the a-proteobacterium Ruegeria pomeroyi DSS-3, and the 
AZOBR_p4 plasmid from the a-proteobacterium Azospirillum brazil- 
ense Sp245 (Figure 2). The availability of a complete genome for these 
two taxa indicates that no additional homologs are present on the 
chromosome. We could not find any other Reb homologs in viruses, 
Archaea, or Eukarya, apart from four homologs from the Global 
Ocean Sampling marine metagenome sequence database that are 
closely related to Proteobacteria (not shown). 

Of importance, for three taxa with available sequence data, we 
could link for the first time the previously reported observation of R- 
bodies (see Introduction and references therein) with the presence of 
Reb homologs. In particular, R. centenum has six copies, A. avenae has 
four copies, and M. mediterranea has nine copies (Figure 2). Reb- 
containing taxa display a wide variety of lifestyles. Albeit many taxa 
harboring Reb homologs harbor free-living lifestyles in a wide variety 
of environments, from marine to terrestrial, a few taxa other than C. 
taeniospiralis appear to have an interaction with eukaryotes. For ex- 
ample, Pseudovibrio sp. JE062 is a symbiont of sponges; Vibrio fischeri 
ESI 14 is the specific bioluminescent symbiont in the light-emitting 
organs of certain squids and fishes; Labrenzia alexandrii and Oceanicaulis 
alexandrii have been isolated from dinoflagellates; Stenotrophomonas 
maltophilia R551-3, Azospirillum brasilense Sp245, and Pseudomonas 
fluorescens Pf-5 are plant growth— promoting endophytes; various strains 
of Burkholderia ambifaria are important in the biocontrol of pea plant 
phytopathogens; and Azorhizobium caulinodans is a nitrogen-fixing 



proteobacterium involved in mutualistic rhizobiale symbioses with 
plant roots. 

Other than in the algae pathogen Kordia algicida, we also found Reb 
homologs in a number of important pathogens of plants such as Acid- 
ovorax avenae subsp. avenae ATCC 19860; various strains of Xantho- 
monas; Burkholderia gladioli and B. glumae; Ralstonia syzygii R24; but 
also in important pathogens of aquatic animals such as shrimp and 
corals {Vibrio nigripulchritudo ATCC27043; Vibrio coralliilyticus 
ATCC-BAA450). Reb homologs were also found in the genomes of 
opportunistic pathogens of humans, such as various strains of Pseudo- 
monas aeruginosa, including the hypervirulent LESB58 strain; in various 
strains of Burkholderia pseudomallei, the causative agent of meilodiosis, 
in B. mallei, that causes glanders and pneumonia; and Stenotrophomonas 
maltophila, a rare but serious threat to patients who require catheteriza- 
tion. We also observed some interesting patterns by looking at the 
distribution of Reb homologs in closely related strains (Figure 2). For 
example, although the sponge symbiont Pseudovibrio sp. JE062 harbors 
reb genes, its closely related free-living relative Pseudovibrio sp. FO-BEGl 
does not. Similarly, Burkholderia thailandensis MSMB43, an opportunis- 
tic pathogen that causes meilodiosis, has reb genes, whereas the closely 
related B. thailandensis E264, a common soil and avirulent strain, does 
not (Figure 2). 

Sequence analysis 

Despite the widespread presence of Reb homologs in many bacterial 
taxa, it remains to be proven experimentally that these are responsible 
for producing R-bodies. However, some hints can already be gained 
from sequence analysis. 

Reb homologs are 95 amino acids long on average. They show 
good conservation at the sequence level and all display a basic alpha 
helical secondary structure with no significant structural difference 
among sequences (Figure 4A). Very little is known about the regula- 
tion and mechanism of R-body assembly [for review see, (Sanchez- 
Amat 2006)]. We could not observe any particular pattern in the 
sequences that allows distinguishing the equivalents of RebA, B, and 
D of C. taeniospiralis in other taxa, and this was also confirmed by 
phylogenetic analysis. Overall sequence conservation is high, suggest- 
ing that these Reb homologs are likely functional and conserved at the 
structural level. A WebLogo analysis highlighted highly conserved 
positions (Figure 4B) that may be important for R-body assembly 
and/or unrolling- rolling, and should be the target of choice of future 
mutation studies. Unfortunately, we could not identify any homolo- 
gous proteins with a solved crystal structure in extant databases (see 
Materials and Methods). Small proteins assembling into structures 
frequently display amphipathic helices, which consist of hydrophobic 
amino acids concentrated on one side and hydrophilic or polar amino 
acids on the other, and these can be highlighted using helical wheel 
diagrams (see Materials and Methods). However, we found no evi- 
dence that the helices of Reb homologs display amphipathic character 
(data not shown). Obtaining the 3-D structure of an R-body will 
therefore be essential to understand how Reb homologs assemble 
and function. 



Figure 2 Distribution of Reb homologs. Presence/absence of Reb homologs in proteobacteria and Kordia algicida. Colors indicate the different 
proteobacterial subdivisions. For each genome that harbors Reb homologs, we included complete genomes of closely related taxa without any 
reb genes, when available. Taxa with no available complete genome sequence but harboring Reb homologs are highlighted in gray. For these 
taxa, the presence of extra Reb copies cannot be excluded. When present, Reb homologs are indicated by their corresponding accession 
number. Reb homologs located on plasmids are indicated by an asterisk. See main text for discussion. 
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Bdellovibrio bacteriovorus M292759 Bdellovibrionales (1) 

Desulfurella acetivorans X72768 Desulfurellales (1 ) 

Desulfovibrio vulgaris af418179 Desulfovibrionales (30) 

Desulfuromonas michiganensis AF357915 Desulfuromonadales (1 5) 
Myxococcus xanthus fM202667 MyXOCOCCaleS (1/12) 



Syntrophobacter fumaroxidans X62874 Syntrophobacterales (3) 
- delta proteobacterium MLMS-1 AY459365 unclassified Deltaproteobacteria (1 ) 
Desulfobacter vibrioformis ^J^2254 Desulfobacterales (1/9) 



delta proteobacterium NaphS2 aj 132804 unclassified Deltaproteobacteria (1 ) 

Desulfarculus baarsii DSM 2075 AF418174 Desulfarculales (1 ) 

Magnetococcus sp. MC-1 cpooo47i unclassified Proteobacteria (1 ) 
Neorickettsia sennetsu M73219 Rickettsiales (60) 



■ Rhodospiriiium rubrumATCC 11170 CP000230 Rhodospiriliales (2/75) 

Micavibrio aeruginosavorus DQI86612 unclassified Alphaproteobacteria (1) 

Sphingomonas trueperi X97776 Sphingomonadales (58) 



a 



Kopriimonas byunsanensis DQ167245 



Kopriimonadales (0) 
Kiloniella laminariae AM749667 Kiloniellales (0) 

Sneathiella chinensis DQ219355 Sneathiellales (0) 

— Candldatus Punicelspirillum marinum IMCC1322 cpooi7§AR1 16(1) 
Candidatus Pelagibacter sp. /MCC9063 CP0025iiSAR1 1 (5) 

Kordiimonas gwangyangensis AY682384 Kordiimonadales (0) 

Parvularcula bermudensis AF544015 Parvularculales (1 ) 

Caulobacter fusiformis M007603 Caulobacterales (1/16) 

- Poiymorphum gilvum SL003B-26A1 CP002568 unclassified Alphaproteobacterla (1/2) 
Rhodobacter capsuiatus D\QA2% Rhodobacterales (7/103) 



Agrobacterium tumefaciens AJ389894 Rh izobiales (1/244) 
Mariprofundus ferrooxydans PV-l EF493243 Zetaproteobacteria (1 ) 
- Salinisphaera shabanensis AJ421425 Salinisphaerales (1 ) 



Xanthomonas campestris pv. campestris str. ATCC 33913 AE008922 XanthomonadaleS (1 2/51 ) 
- Acidithiobacillus ferrooxidans AF465604 Acidithiobacillales (13) 

Hydrogenophilus thermoluteolus AB009828 Hydrogenophilales (1 ) 

Methylophilus methylotmphus AB^9372A Methylophilales (8) 

Neisseria gonorrhoeae X077^ A Nelsseriales (1/71) 



y 

IP 



Car\didatus Procabacter sp. UWE2 afi 77424 Procabacteriales (0) 
Buriihoideria fur)gorum kf2'\57Q5 Burkholdehales (23/266) 

Nitrosomonas europaea AB070982 Nitrosomonadales (14) 

Rhodocyclus purpureus m34i 32 Rhodocyclales (8) 



Candidatus Accumulibacter phosphatis clade IIA str. C/lV-r CP001715 unclassified Betaproteobacteria (6) 

Gallionella capsiferriformans DQ386262 Gallionellales (2) 
Femtrophicum radicicola DQ386263 Ferritrophicales (0) 
■ Methylococcus capsulatus AJ563935 Methylococcales (4) 

Cardiobacterium valvarum AF506987 Cardiobacteriales (3) 

Chromatium okenii M223234 Chromatiales (26) 

gamma proteobacterium HdNl AF33i97»>inclassified Gammaproteobacteria (1) 

Legionella lansingensis z49727l-egionellales (27) 



Caedibacter taeniospiralis HR_0A2741 Thlotrlchales (1/35) ^^^^H 

Candidatus Ruthia magnifica str. Cm Calyptogena magnifica CP000488 sulfur-oxidizIng symbionts (5) 

I Pseudomonas psychnjphila AB04-\885 Pseudomonadales (5/144) 

81I gamma proteobacterium HTCC2207 AY386335 SAR92 Clade (1 ) 

gamma proteobacterium Mel5 EU672859 OM60 Clade (1 ) 
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marine gamma proteobacterium HTCC2143 AY386333 BD1-7 Clade (1 ) 
gamma proteobacterium IMCC2047 EF468718 unclassified Gammaproteobacteria ( 1 ) 
Oceanospirillum maris subsp. w////amsae aboo6763 Oceanospirillales (2/14) 
. Reinekea aestuarii GQ280347 unclassified Gammaproteobacteria (1 ) 

Aiteromonas marina AF 529060 Alteromonadales (2/73) 

Vibrio mediterranei X74710 Vlbrionales (3/1 37) 

— Aeromonas salmonicida subsp. salmonicida AY987751 Aeromonadales (23) 
Pasteurella pneumotropica AY362924 Pasteurellales (1 06) 
Candidatus Baumannia cicadeiiinicoia AF465794 unclassified Gammaproteobacteria (1 ) 
Escherichia coli X80725 Enterobacteriales (656) 
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When present in multiple copies, Reb homologs are clustered on 
the genome, mostly lying side-by-side or separated by a few in- 
tervening genes (Figure 5). We generally found only one reb locus per 
genome, with the exception of M. mediterranea whose nine homologs 
are organized into two different genomic regions, and Xanthomonas 
axonopodis and Chromobacterium violaceum, which both have an 
extra Reb homolog located far from the main cluster (Figure 5). Given 
their short length, the phylogeny of Reb homologs is globally poorly 
resolved, but a few monophyletic groups are apparent which are 
consistent with genomic synteny (Supporting Information, Figure SI 
and Figure 5). This allowed us to infer the evolutionary history of 
these Reb homologs (Figure 5). In some cases, Reb homologs from the 
same taxon are more closely related to each other than to Rebs of 
other taxa, suggesting that these have arisen from species-specific 
duplication events. 

This is for, instance, the case of five reb genes from Marinomonas 
sp. MED121, which are all more closely related to each other than to 
any other Reb (Figure 5). The same can be said for the six of the seven 
reb genes from Vibrio fischeri ESI 14 (Figure 5). In other cases, there is 
clear evidence for vertical inheritance of Reb proteins from the ances- 
tor of a specific Proteobacterial family (e.g., Xanthomonas) (Figure 5). 
In yet other instances, Reb proteins are most closely related among 
distantly related lineages, suggesting horizontal transfer of the whole 
locus, for example in the case of Marinomonas mediterranea MMB- 1 
and Shewanella denitrificans OS217 (Figure 5). Horizontal gene trans- 
fer was also suggested for extra reb copies in a few taxa (Figure 5). 
Finally, phylogeny could not help in assigning the equivalents in other 
taxa of the RebA-B-D of C. taeniospiralis, as these are more closely 
related among themselves (Figure SI). It is therefore difficult to make 
analogies between the previously reported data on the role of the 
RebA-B-D of C. taeniospiralis in the assembly process of its R-bodies 
and what occurs in the other taxa. Moreover, because we found no 
homologs of RebC outside C. taeniospiralis^ it is possible that other 
proteins have analogous function in Reb-harboring taxa. This finding 
would be consistent with RebC being a transcription regulator and 
therefore potentially species- specific. 

In search for potential partners of Reb proteins 

It has been shown that RebA, B and C from C. taeniospiralis are 
sufficient for production of type 51 R-body in E. coli but not for the 
killing phenotype (see Introduction) ^ which indicates that yet-uniden- 
tified partners coded on either the plasmid or the chromosome of C. 
taeniospiralis are involved in the killing. Interestingly, we found no 
homologs of the 63 proteins encoded in the C. taeniospiralis pKAP298 
plasmid in any of the Reb-harboring taxa. This may indicate that none 
of these proteins is a likely candidate for the killing toxin, which would 
be then encoded in the C. taeniospiralis genome (yet unavailable). 
Alternatively, the C. taeniospiralis toxin may well be on the plasmid 
but is not conserved in other bacteria, which may either display no 
killing activity or use nonhomologous toxins. 

To search for candidate partners of Reb proteins, we carried out 
a genome synteny analysis of the reb locus in 41 taxa for which 
a complete genome or sufficient genomic structure information is 



available (Figure 6, see Materials and Methods). Two mutually ex- 
clusive synteny patterns could be observed (hereafter referred to as 
Group 1 and 2, respectively; Figure 6). Strikingly, the genes included 
in these conserved synteny patterns are exclusively present in Reb- 
harboring taxa, strongly suggesting a functional link with the Reb 
system. The Group 1 synteny pattern is defined by four proteins 
annotated as hypothetical: HPl.l (red), HP1.2 (blue), HP1.3 (yel- 
low), and HP 1.4 (orange), which are only found in the surroundings 
of the reb locus and are only present in Reb-harboring taxa. The 
HP1.4 (-60-80 aa) and HP1.3 (!170 aa) proteins appear to be 
distant Reb homologs. However, they lack some of the conserved 
amino acid positions characteristic of other Reb proteins, and the 
HP 1.3 protein is approximately twice as long as a typical Reb (data 
not shown). The HPl.l (-360 aa) and the HP1.2 (-110-120 aa) 
proteins display no putative conserved domains. These four proteins 
often exhibit the same genomic organization (HP1.1,HP1.2,HP1.3, 
HP 1.4; Figure 6). In three cases, another hypothetical protein 
(HP 1.5, purple) is associated with this context, and is distantly re- 
lated to the HP 1.3 protein (Figure 6). 

The Group 2 synteny pattern is defined by the presence of two 
proteins annotated as hypothetical: HP2.1 (light blue) and HP2.2 
(fuchsia) (Figure 6). These proteins are approximately the same size 
(-205—220 aa) and display no putative conserved domains. They fre- 
quently co-occur, but their genomic organization varies in different taxa. 
An interesting characteristic of Group 2 synteny pattern is the frequent 
association with a putative RNA polymerase sigma-factor protein 
(HP2.3, light pink) and a transcriptional regulator/cyclic nucleotide 
binding protein (HP2.4, dark pink), which might be involved in tran- 
scription regulation of reb genes in these taxa. It should be noted that 
the conserved Group 1 and Group 2 synteny patterns are generally 
consistent with the Reb clusters highlighted by phylogenetic analysis 
(Figure 5) and have phylogenies similar to the Reb one (not shown), 
indicating a common evolutionary history and providing fiirther sug- 
gestion of a flinctional link between these proteins and Reb proteins. As 
an additional strong indication, the plasmid sequence of A. brasilense 
contains the four proteins characteristic of the Group 1 synteny pattern. 

A few taxa did not present any particularly conserved genomic 
context nor did they harbor the conserved genes found in Group 1 or 
Group2 synteny patterns. It is therefore possible that other genetic 
elements important for reb fiinction are located in different positions of 
the genome in these taxa. To this end, we sought to identify additional 
proteins specific to Reb-harboring taxa by carrying out a whole-genome 
content analysis (see Material and Methods). Using the complete 
genomes of 841 bacterial taxa — ^including 25 Reb-harboring complete 
genomes — ^we constructed protein families having at least 50% identity 
and 80% size conservation (see Material and Methods). This analysis 
confirmed that the only protein family exclusively present in Reb- 
harboring taxa is the Reb family itself, along with the protein fam- 
iUes specific to Group 1 and 2 synteny patterns (Figure 7). Although 
genomes of taxa not containing Reb homologs harbored distant 
homologs of the HP2.3 and HP2.4 (light pink and dark pink) pro- 
teins of genome synteny patterns, these fell outside of subfamilies 
which are exclusively present in Reb-harboring taxa. 



Figure 3 Distribution of Reb-harboring taxa across Proteobacteria. Unrooted Maximum likelihood phylogenetic tree of 1 6s rRNA sequences from 
60 taxa representative of proteobacterial diversity. Proteobacterial orders that include members containing Reb homologs are highlighted in red. 
The number of Reb-harboring taxa over the total number of available complete genomes is indicated in parenthesis. Caedlbacter taeniospiralis 
belongs to the gammaproteobacterial family of Thiotrichales (indicated by a red arrow). The tree was obtained using Treefinder with the J1 model 
of nucleotide substitution and a discrete gamma distribution with four categories to take into account among-site rate variation. Numbers at 
nodes indicate bootstrap values (BV) for 100 replicates of the original dataset. For clarity, only BVs greater than 50% are shown. The scale bar 
represents the average number of substitutions per site. 
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Caedibacter taeniospiralis AAR87076 rebB 
Caedibacter taeniospiralis AAR87075 rebD 
Caedibacter taeniospiralis AAR87077 rebA 
Vibrio fischeri ES114 YP_207092 
Burkholderia ambifaria AMMD YP_773104 
Azorhizobium caulinodans ORS 571 YP_001526698 
Kordia algicida OT 1 ZP_02161961 
Vibrio coralliilyticus ATCC BAA450 ZP_05885799 
Acidovorax avenae ATCC 19860 YP_004236846 
Marinomonas mediterranea MMBl ADZ 9 1907 
Burkholderia gladioli BSR3 AEA62615 
Vibrio nigripulchritudo ATCC27043 ZP_08730923 
Azospirillum brasilense Sp245 CCD03671 
Ralstonia syzygii R24 CCA86917 
Chromobacterium violaceum ATCC12472 NP 900393 



KSQITDSVTOTNTKILGEMPAFTTGSLMOMATOAAGLSIONSVTNOOOSNMLHOASTTQGMSILNAQAIGS^ 

NPOVTDSVTSNMMNVIGMAPAOAMGVLYOGVASTANLAIONAOSSSOOLNOIGOAVTSVACERIMG 

VNSOITDSVTOTNTMVLGSTPAQIMTNLMOMSAOANGLAMONAWGOKOTNMLSDAATTOGLSIMDGOAVST\ 
VNSOITDSVTOVNTQWGETPAMAMGNLOMSTSHALSNTAHNATSAOOOGOITMOAATVOGVNSLIGRVGEE] 
VNAAITDAVTOTNVKWGEAPAMATGSLYOAOSHSLSVLFENAVOOOSOAAVQLPTSTNVGTMOLAGAAATE] 
VSPOVTDAVTOVNVKWGEAPAMALGSLYOTAAHSTGLLFANAVTAONNONILAOAATTOGVMOIDAIAIVQ^ 
VNKOITDAVTOTNVQWADAPAIALANVYQTAAHSTGLMFANSVNAOSQQNILAOAATTOGILOIDAASIIKI 
VNEOVTDSVTQVNTKWGETPAMAMGNLLMSTSHALSNAAHNATAAOOQAOITMOAATVQGVNSLIGRGAEG] 
LNEEWSAVAIGNLKAISEOPAMLSNLAYSNTVANTNLSOONAVANOOAMNELGISIVAKASNTVARSAVDII 
VNEOITDSVTOVNTKWGETPAMAMGNLLMSTSHALSNAAHNATAAOOOAQITMOAATVQGVNSLIGRSSEG] 
VNSOITDSVTOANTKVLGDAPAIALGNLYOATAOALANAAHNATNAOOOSYVTAQAATTMGVATLTGVATSE] 
VNGOVTDSVTOVNTKWGETPAMAMGNLLMSTSOALGTAAHNATAGOOOAOITMOAATVQGVNSLIGRSAEGl 
VNDOITDSVTOANTKVLGDAPAIAMGNLFOATAOALANAAHNATNNOOOSYVTMOAATTMGVATLTGVATKS] 
VNSOITDSVTOANTKVLGDAPAIAMGNLYOATAOALANAAHNATNAOOOSYVTAOTATTMGVATLTGAATTG] 
3?NDQITDAVTQTNVKVLAEAPAMAMGSLFQTASHSTGILFENATSAQQQQNTLSQAAANMGVMQIDAAGTEja 




Figure 4 Sequence analysis. (A) Secondary structure of the RebB of C. taeniospiralis predicted by PSIPRED [http://bioinf.ucl.ac.uk/psipred 
(Buchan et a/. 2010)]. The same structure was substantially conserved in all other Reb homologs. (B) Conserved amino acid positions identified 
using Weblogo on an unambiguously aligned excerpt of the entire alignment of the 203 identified Reb homologs. For clarity, only 15 repre- 
sentative Reb sequences are shown. Position numbers refer to the RebB of C. taeniospiralis. 



These few conserved proteins might be either involved in regulation 
of R-body assembly and function, or represent the toxin, and should be 
priority targets for future studies. It v\dll also be interesting to test 
whether the Reb-harboring taxa that harbor none of these candidate 
partners are able to make R-bodies or display killing activity. Finally, it 
should be noted that none of these proteins belong to the Soj-ParA 
family or any annotated membrane-associated ATPase, weakening the 
previous hypothesis that these types of protein may represent the toxin 
responsible for killing (Jeblick and Kusch 2005). 

A phage origin? 

Jeblick and Kush emphasized the presence of phage-related genes on 
the reb carrying plasmid (Jeblick and Kusch 2005) and Preer (Preer 
et al 1974) observed an association of phage-like particles with R- 
bodies, suggesting that R-bodies may be encoded by defective phage 
genes. Moreover, the evolutionary analysis of Reb families (Figure 5) 
and their genomic context (Figure 6) suggest horizontal gene transfer 
events, for which bacteriophages are known to be major contributors. 
However, we found no Reb homologs in genomic sequences from 
phages. We therefore sought to see if Reb homologs are part of in- 
tegrated elements or prophages. We examined the 40 kbp on each side 
of the reb locus in the 25 Reb-harboring taxa for which complete 
genome sequences are available for the presence of prophages (in- 
tegrated phages) or other phage-related elements. First, we searched 
using the PHAST database [http://phast.wishartlab.com (Zhou et al 
2011)], which contains phage proteins that have been associated with 
a clear phage function. However, none of these regions were positive 



for prophage sequences. As a complementary analysis, we specifically 
searched a comprehensive local databank of 1130 bacteriophage 
sequences downloaded from GenBank (December 2011), which in- 
cluded 248 phages isolated from 33 proteobacterial genera. These 
regions did not display any specific similarity to phage elements. 
We also looked at whether reb loci are embedded in genomic islands 
by running searches on the IslandViewer server [http://www.pathoge- 
nomics.sfu.ca/islandviewer (Langille and Brinkman 2009)], which 
combines several prediction methods: (1) atypical dinucleotide con- 
tent; or (2) codon usage; (3) identification of unique regions not 
present in closely related genomes; and (4) presence of genes that 
are functionally related to mobile elements. However, none of the 
analyzed genomes displayed identified potential genomic islands ad- 
jacent to or surrounding the reb locus. We also verified from the 
literature whether Reb homologs were present in any previously 
reported genomic region of potential exogenous origin. For example, 
reb genes did not fall into any of the two atypical regions identified in 
the genome of Xanthomonas oryzae pv. oryzae PX099A (Salzberg 
et al 2008), nor in the four atypical regions highlighted in the genome 
of Xanthomonas campestris pv. campestris str. ATCC 33913 
(Vorholter et al 2003), and were not included in any of the prophage 
islands identified in the genome of the Pseudomonas aeruginosa hy- 
pervirulent LESB58I strain (Winstanley et al 2009). Finally, among 
1062 plasmid sequences available from Proteobacteria, we detected reb 
genes only on two plasmids other than the C. taeniospiralis plasmid 
pKAP298: the Azospirillum brasilense Sp245 plasmid AZOBR_p4 and 
the Ruegeria pomeroyi DSS-3 megaplasmid. 



512 I K. Raymann et al. 



•I^rGS* Genes | Genomes | Genetics 



Pseudomonas aeruginosa PACS2@ZP_01 366280 ^M^M 
Pseudomonas aeruginosa LESB58@YP_002439855 ^^^m 

Pseudomonas aeruginosa UCBPP PA14@YP_790355 ^™ 
Pseudomonas aeruginosa PACS2@ZP_0 1366286 ^™ 
Pseudomonas aeruginosa LESB58@YP_002439850 




Vibrio nigripulchritudo ATCC 27043@ZP 08730924 ^m^m^m 

Vibrio coralliilyticus ATCC BAA 450@ZP_05885799 ^m^m^m j| 
Xanthomonas axonopodis pv citri str 306@NP_643326 1 
Stenotrophomonas maltophilia R551 3@YP_002028486 


Pseudomonas aeruginosa UCBPP PA14@YP_790354 
Pseudomonas aeruginosa PACS2@ZP_0 1366287 
Pseudomonas aeruginosa LESB58@YP_002439849 
Pseudomonas aeruginosa 3901 6@ZP_077941 30 

Burkholderia thailandensis IVISMB43@ZP 02461986 ^H^H^H^H 
Burl<holderiaol^lahomensisE0147@ZP 02354118 ^mr- 
Burkholderia oklahomensis C6786@ZP 02361295 r 1 


] 


Acidovorax avenae sb. avenae ATCC 19860 @ YP_004236847 
Acidovorax avenae sb. avenae ATCC 19860 @YP_004236845 

Acidovorax avenae sb. avenae ATCC 19860 @YP_004236848 ^ 
Plesiocystis pacifica SIR 1@ZP_01912580 
Plesiocystis pacifica SIR 1@ZP_01912579 
Plesiocvstis pacifica SIR lOZP 01912578 


Burkholderia thailandensis MSMB43@ZP_02461985 t 1 1 

Burkholderia oklahomensis EO147@ZP_023541 17 b ,l 

Burkholderia oklahomensis C6786@ZP_02361294 ^H^K^HHH 


Plesiocystis pacifica SIR 1@ZP_01912576 
Plesiocystis pacifica SIR 1@ZP_01912575 
Plesiocystis pacifica SIR 1@ZP_01912570 


Burkholderia pseudomallei K96243@YP_1 06893 
Burkholderia pseudomallei 668@YP_001057315 

Burkholderia pseudomallei 576@ZP_03456481 1 1 
Burkholderia pseudomallei 305@ZP 01770366 1 1 
Burkholderia pseudomallei 1106a@YP_001 064557 1 1 
Burkholderia mallei SAVP1@ABM4g889 1 1 
Burkholderia mallei NCTC 10247@ABO06552 1 1 
Burkholderia mallei NCTC 10229@ABN03246 1=1 
Burkholderia mallei ATCC 23344@AAU50114 


Caedibactertaeniospiralis@AAR87077 
Caedibactertaeniospiralis@AAR87076 

Desulfatibacillum alkenivorans AK 01@YP_002433826 


Desulfatibacillum alkenivorans AK 01@YP_002433825 
Desulfatibacillum alkenivorans AK 01@YP_002433824 

MarinomonasspMED121@ZP 01077811 
MarinomonasspMED121@ZP 01077809 

MarinomonasspMED121@ZP 01077810 ■ — ^1 ^1 ^1 ^1 1 ■ 


Burkholderia oklahomensis EO147@ZP_02354116 ^^■^■^■^H 
Burkholderia oklahomensis C6786@ZP_02361293 ^■^■[^■^H 

Ralstonia svzvail R24ra)CCA86gi5 ^ 




Marinomonas sp MED1 21 @ZP_01 077808 
Marinomonas sp MED1 21 @ZP_01 077807 


Vibrio fischeri ES114@YP 207095 
Vibrio fischeri ES114@YP_207094 


Burkholderia thailandensis MSMB43@ZP_02461 983 V 
Burkholderia oklahomensis EO147@ZP_02354115 
Burkholderia oklahomensis C6786@ZP 02361292 

Burkholderia gladioli BSR3@AEA62617 ^m^^^m 4 


Vibrio fischeri ESI 14@YP 207092 
Vibrio fischeri ES114@YP 207091 
Vibrio fischeri ES114@YP_207090 


Burkholderia ubonensis Bu@ZP 02381584 ^H^H^H 
Burkholderia sp 383@YP_366670 ^m^m^m 
Burkholderia ambifaria MEX 5@ZP 02905801 ^H^B^H 
Burkholderia ambifaria MC40 6@YP 001807947 IH^H^H 
Burkholderia ambifaria IOP40 10@ZP_02889070 ^m^m^m 
Burkholderia ambifaria AMMD@YP_7731 05 ^H^H^H Ml 




Pseudomonas fluorescens Pf 5@YP_257330 ^^^^ '_~^^B 
Pseudomonas fluorescens Pf 5@YP_257329 


Chromobacterium violaceum ATCC 12472@NP_900394 

Chromobacterium violaceum ATCC 12472@NP_900392 
Chromobacterium violaceum ATCC 12472@NP_900391 


Burkholderia ambifaria AMMD@YP_773103 ^m^m^m 

Burkholderia sp 383@YP_366668 ■■^■1^1 

Burkholderia ambifaria MEX 5@ZP 02905799 ^■II^B^Ml^=.^=.^M 

Burkholderia ambifaria MC40 6@YP 001807945 ^ 1 I^H 

Burkholderia ambifaria IOP40 1 0@ZP_02889068 ^■■■^■(^^□^^^^H 
Burkholderia ubonensis Bu@ZP_02381586 ^B^B^IH^^I^^im 




Roseibium spTrichSKD4@ZP_07661117 ^m^J^M 
Polymorphum gilvum SL003B 26A1@ADZ69851 ^■^■1 

Labrenzia alexandrii DFL 11@ZP_05112322 ^Hia^^H ^ 

Marinomonas mediterranea MMB 1@ADZ91956 ^M^M^M^M 
Shewanella denitrificans OS217@YP 564150 ^ '^H 


Burkholderia ubonensis Bu@ZP_02381585 ^■^■^H^^I^^I^H 
Burkholderia sp 383@YP 366669 ^■^■^■CZ^I^^I^H 
Burkholderia ambifaria MEX 5@ZP 02905800 ^■^^■^■(^□[^^l^H 

Burkholderia ambifaria MC40 6@YP 001807946 l^^m^m 

Burkholderia ambifaria IOP40 10@ZP_02889069 '^m^ — ^^l^H 

Burkholderia ambifaria AMMD@YP_773104 ■ ^^l^B 


Shewanella denitrificans OS217@YP_564149 r B 
Marinomonas mediterranea MMB 1@ADZ91955 m B 

Shewanella denitrificans OS217@YP_564148 ^H^HHl^a 
Marinomonas mediterranea MMB 1@ADZ91954 ^H^^B^H^H 


Burkholderia ubonensis Bu@ZP 02381581 ^ 

Burkholderia sp 383@YP 366673 ^m^m^m ^9 
Burkholderia ambifaria MEX 5@ZP_02905804 1 1 
Burkholderia ambifaria MC40 6@YP_001 807950 ^m^m^m I I 
Burkholderia ambifaria IOP40 10@ZP 02889073 ^^m^m^m | 1 


Shewanella denitrificans OS217@YP_564147 

Marinomonas mediterranea MMB 1@ADZ91953 ^^I^H^H^H 

Vibrio nigripulchritudo ATCC 27043@ZP_08730925 ^m^m 
Vibrio coralliilyticus ATCC BAA 450@ZP_05885798 ^H^H 


Burkholderia ambifaria AMMD@YP_7731 08 i^^B^B I 1 

Xanthomonas gardneri ATCC 19865@ZP_08181385 

Xanthomonas campestris ATCC 33913@NP_638256 

Xanthomonas campestris str 8004@YP_242290 I 
Xanthomonas campestris pv vas. NCPPB 702@ZP_06486467 
Xanthomonas oryzae pv oryzicola BLS256@AEQ97370 
Xanthomonas oryzae pv oryzae PXO99A@YP_001914301 

Xanthomonas perforans 91 118@ZP_08187121 ^m^^ 
Xanthomonas campestris pv vesicatoria str 85 10@YP_364948 

Xanthomonas fuscans sb aurantifolii ICPB 11122@ZP_06705918 ZIT 
Xanthomonas axonopodls pv citri str 306@NP_643396 


Marinomonas mediterranea MMB 1@ADZ91908 ^H^H^H^H ^ 

Vibrio fischeri ES1 1 4@YP_207085 W^^m 
Vibrio nigripulchritudo ATCC 27043@ZP 08730919 I^^B^^B^^B 
Vibrio coralliilyticus ATCC BAA 450@ZpI05885804 

Marinomonas mediterranea MMB 1@ADZ91901 W 1 ^^H^H^H^H 

Oceanicaulis alexandrii HTCC2633@ZP 00953290 t 1 


KordiaalgicidaOT1@ZP 02161961 I^B Hi 

KordiaalgicidaOT1@ZP 02161960 

KordiaalgicidaOT1@ZP_02161959 

Azorhizobium caulinodans ORS 571 @YP_001 526702 

Azorhizobium caulinodans ORS 571 @YP_001 526698 



Figure 5 Evolutionary inference of Reb homologs based on phylogenetic analysis of the 203 Reb homologs (Figure S1). Here we have highlighted a few 
of the monophyletic groups. For each taxon, the genome locations of the corresponding Reb proteins are shown. Reb homologs highlighted in green are 
orthologs that were inferred to have been inherited through speciation events; those highlighted in blue represent paralogs issued from species-specific 
gene duplications; and those in red are the Reb homologs that have likely originated via horizontal transfer. Adjacent reb genes are indicated in gray. 
Open reading frames between reb genes are shown in white, and black slash-like symbols represent large intervening regions between reb genes. 



DISCUSSION 

Despite having been continuously and intensively studied from the 
1930s through the 1980s, recent data on R-bodies have been scarce. 
With a whole array of novel technology, studies on the diversity and 
role of these puzzling bacterial structures can now be fully tackled. 
Our exhaustive analysis shows that Reb homologs are widely present 
in Proteobacterial genomes spanning the diversity of this major bacterial 
phylum, indicating that they are much more widespread than pre- 
Adously known. In the perspective of obtaining experimental data, our 
analysis remains for the time being largely descriptive, but nevertheless 
provides a number of interesting hints for discussion and future work. 

Sequence analysis suggests structural and functional conservation, 
indicating that Reb homologs are likely responsible for the production 
of functional R-bodies in all the taxa where we found them, although 
this needs to be verified experimentally. Moreover, the presence of 
Reb homologs in bacteria where R-bodies have been previously 
observed is already a good hint. The occurrence of R-bodies in a wide 
range of bacteria harboring Reb homologs should be tested, with 
priority given to those that have medical, agricultural, and ecological 
implications. Our data will also help direct mutational studies to 
characterize the system further through structural and functional 
analysis of reb genes from Caedibacter but also other taxa, including 



those that harbor multiple reb genes and those that have only one 
copy. Also, it will be important to verify the involvement in R-body 
production, assembly, regulation, and killing of the likely partners that 
we have identified by genome context and whole genome content 
analysis. Because none of the Reb-harboring genomes identified in 
this study possess homologs of the genes carried by the C. taeniospir- 
alis plasmid, it is possible that the killing toxin is encoded in the C. 
taeniospiralis genome and is perhaps one of the candidate genes that 
we have identified. The completion of this genome wiU therefore be 
very important. Another possibility is that the R-bodies are delivery 
systems for species- specific toxins. 

Our analysis of taxonomic distribution shows that Reb homologs 
are present in taxa displaying very different lifestyles, suggesting that 
the role of R-bodies in nature could be quite diverse. Moreover, we 
show that intact reb loci have been spread among Proteobacterial taxa 
via horizontal transfer, indicating that an advantage exists in acquiring 
and keeping R-bodies. However, we found no clear signs of a phage 
origin for the reb loci. It remains possible that this is due to an under- 
sampling of phages from the Reb-harboring taxa or that these hori- 
zontal transfers correspond to events old enough to have allowed 
sequences to adapt to the new genome, or that all traces of the transfer 
vectors have been erased from the genome after transfer. 
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Figure 6 Genome synteny analysis of the reb locus mapped onto an Unrooted Maximum likelihood phylogenetic tree of 16s rRNA sequences 
from the 64 reb-containing Proteobacteria. The tree was created using Treefinder with the GTR model of nucleotide substitution and a discrete 
gamma distribution with four categories to take into account among-site rate variation. Numbers at nodes indicate BVs for 100 replicates of the 
original dataset. For clarity, only BVs greater than 50% are shown. The scale bar represents the average number of substitutions per site. Species 
where Reb homologs are located on a plasmid are marked by a black circle. A white star in a red circle marks the fully sequenced genomes used in 
the analysis. Reb homologs are shown in green. Homologous genes are represented by the same color. For clarity, only genes discussed in the 
text are indicated. Black slash-like symbols represent large regions in between genes. The RebC of C.taeniospiralis (AAR87131) is shown in light 
green and outlined in black to indicate its lack of homology with the other rebs. The genome context for the Flavobacterium K. algicida is shown 
separately. 



Some of the Reb-harboring taxa have very important ecological, 
agricultural, and medical relevance. In addition, by observing the 
pattern of presence/absence of reb genes in closely related strains, we 
found intriguing links between virulence and presence of reb genes 
that will surely be worthy of further investigation. R-bodies may be 
involved in mediating interactions of these bacteria with eukaryotic 
cells, perhaps through the triggering of unrolling when ingested in the 
vacuole, similarly to what observed in the Caedibacter/Paramecium 
interaction. 

We found Reb homologs in many free-living bacteria. R-bodies in 
these bacteria may be used as a defense mechanism against grazing by 
eukaryotes. Bacteria have in fact developed various strategies against 
protozoan predation in nature (Matz and Kjelleberg 2005). Many 



examples have been reported of cytotoxicity responses against grazing 
by eukaryotes in different bacteria such as Pseudomonas (Matz et al 
2004; Weitere et al 2005) and Vibrio (Erken et al 2011). Also, it is 
tempting to speculate that some of these free-living, Reb-harboring 
taxa can establish transient "killing" symbioses with ciliates or other 
protists, as seems to be indicated by the fact that some Reb-harboring 
taxa have indeed been isolated from aquatic microbial eukaryotes. It 
will be interesting to perform feeding experiments to test their killing 
potential in Paramecium and also other protistan taxa, such as algae. 
Indeed, we confirmed the presence of reb genes in Kordia algicida^ 
a planktonic bacterium recently highlighted as a killer of diatoms by 
a yet unclear mechanism involving an unidentified protease (Paul and 
Pohnert 2011). Our study suggests that delivery of this protease could 
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Figure 7 Whole genome content analysis. Graphical representation of 
protein families created using the R software (R Development Core 
Team 2011). The 25 fully sequenced Reb-harboring taxa are repre- 
sented on the X-axis and the other 816 fully sequenced bacterial taxa 
analyzed are represented on the y-axis. Each point on the graph 
represents a protein family (see Materials and Methods for details on 
how protein families were defined). For example, the Reb family (in- 
dicated by a green box) is present in 24 fully sequenced Reb-harboring 
taxa but in none of the remaining genomes. The 4 Rebs of Acidovorax 
avenae subsp. Avenae ATCC 19860 did not fall into the Reb protein 
family because they are very divergent (see Figure 5 and Figure S1). The 
other five unique protein families specific to Reb-harboring taxa (see 
main text) are shown with boxes corresponding to colors as defined in 
the legend to Figure 6. 



be performed via R-bodies, which would therefore be important play- 
ers in the regulation of algal blooms. Similarly, it would be interesting 
to verify whether the presence of Reb homologs in the powerful coral 
pathogen Vibrio coralliilyticus is linked to R-body production and if 
these are somehow involved in delivery of the killing toxin. The killing 
factor produced by K. algicida is triggered independently of the pres- 
ence of the diatom target but rather likely depends on a quorum 
sensing mechanism when the population size reaches a certain density 
(Paul and Pohnert 2011). Similarly, the triggering of R-body produc- 
tion in a fraction of the Caedibacter endosymbiont population in 
Paramecium, a phenomenon that is not yet understood, may be reg- 
ulated by a quorum sensing mechanism. 

In addition, our study suggests that R-bodies may be involved in 
the interaction of Proteobacteria with several multicellular organisms, 
such as plants and animals. The recently reported involvement of reb 
genes in the regulation of a rhizobial symbiosis (Akiba et al. 2010) and 
the presence of Reb homologs in a number of Proteobacteria known 
to interact with plant roots is intriguing, and it is not excluded that R- 
bodies may help the bacterium to move through plant tissues, via 
delivery of specific lytic compounds. Indeed, we found Reb homologs 
in a number of bacterial strains known to be able to penetrate the 
xylem of plants. A similar mechanism may be used to move through 
tissues by some Proteobacteria that interact with animals, such as 
Vibrio fischeri with its squid host. Finally, the presence of reb genes 
in important pathogens of eukaryotes, including humans, some of 
which are responsible for emerging and poorly characterized infections. 



should prompt the study of their potential involvement in the in- 
fection process, perhaps by helping tissue invasion. 

If our predictions are verified, bacteria may represent a largely 
overlooked role in the regulation of microbial eukaryotic abundance 
and distribution, in addition to the much more studied impact of 
viruses. This regulation may be performed at different levels, by direct 
killing of eukaryotic grazers, but also by providing mechanisms used for 
defense among eukaryotes, as is the example of the Paramecium/ 
Caedibacter symbiosis. Interestingly, it was recently reported that the 
thricocysts of eukaryotic algae belonging to the Cryptomonads, ejectile 
organelles that are probably used with a defensive role against predation, 
are composed of four proteins that share similarity with Reb proteins 
(Yamagishi et al. 2012). The authors proposed that these proteins were 
acquired horizontally from Proteobacteria. R-body— harboring bacteria 
could therefore play a larger role in the origin and spread of defense 
mechanisms in eukaryotic microorganisms. Finally, elucidation of the 
mechanism of roUing/unroUing/toxin delivery of R-bodies wiU surely 
open the way to interesting biotechnological applications. 
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